Experiencing Data Grids

نویسندگان

  • Nicolaas Ruberg
  • Nelson Kotowski
  • Amanda S. de Mattos
  • Luciana Matos
  • Melissa Machado
  • Daniel de Oliveira
  • Rafael Monclar
  • Cláudio Ananias Ferraz
  • Talitta Sanchotene
  • Vanessa Braganholo
چکیده

Many scientific experiments deal with data-intensive applications and the orchestration of computational workflow activities. These can benefit from data parallelism exploited in parallel systems to minimize execution time. Due to its complexity, robustness and efficiency to exploit data parallelism, grid infrastructures are widely used in some e-Science areas like bioinformatics. Workflow techniques are very important to in-silico bioinformatics experiments, allowing the e-scientist to describe and enact experimental process in a structured, repeatable and verifiable way. The main purpose of this paper is to describe our experience with Tavena Workbench and PeDRo, which are part of Grid project. Taverna is provided with a workflow toolset and enactor, allowing the specification of processing units, data transfer and execution constraints. As a data entry tool, PeDRo provides a model, a controlled vocabulary and field validations for Web Services descriptions, leveraging the knowledge associated to the workflows. The main contribution of this work is a summary of some considerations drawn by our experience with the use of these tools, emphasizing its advantages and negative aspects, together with proposals for some future improvements.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Models, Methodologies, and Applications

High performance Grid platforms and parallel computing technologies are experiencing their golden age because of the convergence of four critical momentums: high performance microprocessors, highspeed networks, free middleware tools, and highly increased needs of computing capability. We are witnessing the rapid development of computational Grid technologies. Dozens of exciting Grid infrastruct...

متن کامل

G-Monitor: Gridbus web portal for monitoring and steering application execution on global grids

Grids are experiencing a rapid growth in their application and along with this there is a requirement for a portal which is easy to use and scalable. We have responded to this requirement by developing an easy to use, scalable, web-based portal called G-Monitor. This paper proposes a generic architecture for a web portal into a grid environment and discusses our implementation and its application.

متن کامل

Cascade failures and distributed generation in power grids

Power grids are nowadays experiencing a transformation due to the introduction of Distributed Generation based on Renewable Sources. At difference with classical Distributed Generation, where local power sources mitigate anomalous user consumption peaks, Renewable Sources introduce in the grid intrinsically erratic power inputs. By introducing a simple schematic (but realistic) model for power ...

متن کامل

Equidistribution grids for two-parameter convection–diffusion boundary-value problems

In this article, we propose an adaptive grid based on mesh equidistribution principle for two-parameter convection-diffusion boundary value problems with continuous and discontinuous data. A numerical algorithm based on an upwind finite difference operator and an appropriate adaptive grid is constructed. Truncation errors are derived for both continuous and discontinuous problems. Parameter uni...

متن کامل

Improving Data Grids Performance by Using Modified Dynamic Hierarchical Replication Strategy

Abstract: A Data Grid connects a collection of geographically distributed computational and storage resources that enables users to share data and other resources. Data replication, a technique much discussed by Data Grid researchers in recent years creates multiple copies of file and places them in various locations to shorten file access times. In this paper, a dynamic data replication strate...

متن کامل

Improving Mobile Grid Performance Using Fuzzy Job Replica Count Determiner

Grid computing is a term referring to the combination of computer resources from multiple administrative domains to reach a common computational platform. Mobile Computing is a Generic word that introduces using of movable, handheld devices with wireless communication, for processing data. Mobile Computing focused on providing access to data, information, services and communications anywhere an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006